Search CORE

86 research outputs found

Dynamic, Task-Related and Demand-Driven Scene Representation

Author: AL Yarbus
CA Rothkopf
D Soto
DH Ballard
FH Hamker
J Ferrante
J Triesch
JK Tsotsos
JM Henderson
Julian Eggert
L Itti
M Corbetta
M Hayhoe
M Hayhoe
MA Just
MM Chun
R Sedgewick
RA Rensink
RA Rensink
S Amari
S Frintrop
S Ullman
SP Lloyd
Sven Rebhan
V Navalpakkam
V Navalpakkam
Y Aloimonos
Publication venue: Springer-Verlag
Publication date: 01/01/2010
Field of study

Humans selectively process and store details about the vicinity based on their knowledge about the scene, the world and their current task. In doing so, only those pieces of information are extracted from the visual scene that is required for solving a given task. In this paper, we present a flexible system architecture along with a control mechanism that allows for a task-dependent representation of a visual scene. Contrary to existing approaches, our system is able to acquire information selectively according to the demands of the given task and based on the system’s knowledge. The proposed control mechanism decides which properties need to be extracted and how the independent processing modules should be combined, based on the knowledge stored in the system’s long-term memory. Additionally, it ensures that algorithmic dependencies between processing modules are resolved automatically, utilizing procedural knowledge which is also stored in the long-term memory. By evaluating a proof-of-concept implementation on a real-world table scene, we show that, while solving the given task, the amount of data processed and stored by the system is considerably lower compared to processing regimes used in state-of-the-art systems. Furthermore, our system only acquires and stores the minimal set of information that is relevant for solving the given task

Crossref

Springer - Publisher Connector

PubMed Central

Disambiguating Multi–Modal Scene Representations Using Perceptual Grouping Constraints

Author: A Baumberg
A Sha'ashua
A Verri
C Harris
C Schmid
D Crevier
D Field
D Kraft
D Lowe
D Lowe
D Scharstein
E Baseski
E Brunswik
F Schaffalitzky
Florentin Wörgötter
HH Nagel
J Elder
J Elder
J Elder
J Koenderink
J Mayhew
J Rodrigues
J Rodrigues
J Shi
K Koffka
K Köhler
K Mikolajczyk
L van Gool
L Wolff
M Brown
M Felsber
M Felsberg
M Oram
M Popović
N Kim
N Krüger
N Krüger
N Krüger
N Pugeault
N Pugeault
N Pugeault
N Pugeault
N Pugeault
Nicolas Pugeault
Norbert Krüger
O Faugeras
P Kovesi
P König
P Parent
P Perona
R Chung
R Hartley
R Horaud
R Mohan
S Geman
S Sarkar
S Se
SH Lee
Teresa Serrano-Gotarredona
W Freeman
W Geisler
Y Aloimonos
Y Ohta
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

In its early stages, the visual system suffers from a lot of ambiguity and noise that severely limits the performance of early vision algorithms. This article presents feedback mechanisms between early visual processes, such as perceptual grouping, stereopsis and depth reconstruction, that allow the system to reduce this ambiguity and improve early representation of visual information. In the first part, the article proposes a local perceptual grouping algorithm that — in addition to commonly used geometric information — makes use of a novel multi–modal measure between local edge/line features. The grouping information is then used to: 1) disambiguate stereopsis by enforcing that stereo matches preserve groups; and 2) correct the reconstruction error due to the image pixel sampling using a linear interpolation over the groups. The integration of mutual feedback between early vision processes is shown to reduce considerably ambiguity and noise without the need for global constraints

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Open Research Exeter

GoeScholar The Publication Server of the Georg-August-Universität Göttingen

Enlighten

University of Southern Denmark Research Output

Surrey Research Insight

Direct computation of shape cues using scale-adapted spatial derivative operators

Author: A. Blake
A.L. Yuille
A.P. Pentland
A.P. Witkin
A.R. Rao
B. Julesz
B. O'Neill
B. Rogers
C. Blakemore
C. Tyler
D. Blostein
D. Blostein
D. Marr
D.C. Marr
D.J. Field
J. Babaud
J. Bergen
J. Bigün
J. Gibson
J. Gårding
J. Gårding
J. Jones
J. Jones
J. Malik
J.J. Koenderink
J.J. Koenderink
J.J. Koenderink
J.J. Koenderink
Jonas G�rding
K. Kanatani
K. Kanatani
K.V. Mardia
L. Davis
L.G. Brown
L.M.J. Florack
M. Turner
P. Bijl
R.A. Young
R.A. Young
R.P. Wildes
S.B. Pollard
T. Caelli
T. Lindeberg
T. Lindeberg
T. Lindeberg
T. Lindeberg
T. Lindeberg
Tony Lindeberg
Y. Aloimonos
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Introduction: Active Vision Revisited

Author: Y. Aloimonos
Publication venue: Publishers
Publication date
Field of study

CiteSeerX

Real-time Sound Source Localization and Separation based on Active Audio-Visual Integration

Author: Y. Aloimonos
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Qualitative Egomotion

Author: C. Fermüller
Y. Aloimonos
Yiannis Aloimonos
Publication venue
Publication date
Field of study

Due to the aperture problem, the only general unambiguous motion measurement in images is normal flow---the projection of image motion on the gradient direction. In this paper we show how a monocular observer can estimate its 3D motion relative to the scene by using normal flow measurements in a global and mostly qualitative way. The problem is addressed through a search technique. By checking constraints imposed by 3D motion parameters on the normal flow field the possible space of solutions is gradually reduced. In the four modules that comprise the solution, constraints of increasing restriction are considered, culminating in testing every single normal flow value for its consistency with a set of motion parameters. The fact that motion is rigid defines geometric relations between certain values of the normal flow field. The selected values form patterns in the image plane that are dependent on only some of the motion parameters. These patterns, which are determined by the signs of the normal flow values, are searched for in order to find the axes of translation and rotation. The third rotational component is computed from normal flow vectors that are only due to rotational motion. Finally, by looking at the complete data set, all solutions that cannot give rise to the given normal flow field are discarded from the solution space

CiteSeerX